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Abstract 

Typical receiver processing, targeting always the best achievable bit error rate performance, can 
result in a waste of resources, especially, when the transmission conditions are such that the best 
performance is orders of magnitude better than the required. In this work, a processing framework 
is proposed which allows adjusting the processing requirements to the transmission conditions and the 
required bit error rate. It applies a-posteriori probability receivers operating over multiple-input multiple- 
output channels. It is demonstrated that significant complexity savings can be achieved both at the soft, 
sphere-decoder based detector and the channel decoder with only minor modifications. 

Index Terms 

MIMO systems, soft-output detection, sphere decoding 

I. Introduction 

Multiple-input, multiple-output (MIMO) transmission offers increased spectral efficiency by 
concurrently transmitting multiple streams over the same frequency band. Therefore, MIMO 
systems have been adopted by several upcoming wireless communication standards like IEEE 
802.1 In and IEEE 802. 16e. A plethora of approaches have been proposed for the detection 
and decoding of such systems providing different complexity /performance tradeoffs. The linear 
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detection methods (e.g., zero-forcing, MMSE), which calculate hard outputs and employ hard 
channel decoding are of low complexity but provide reduced performance. Conversely, the 
a-posteriori probability (APP) receivers provide increased performance at the cost of highly 
increased processing requirements [QQ|. 

Although APP receiver processing may be demanded for achieving the required error-rate 
performance over "unfavorable" transmission environments (i.e., over ill-conditioned MIMO 
transmission channels and at a low signal-to-noise ratio (SNR) regime) it may result in a 
significant waste of resources under good transmission scenarios where even less complex 
solutions could provide the required performance. Such a waste of resources, in terms of energy 
consumption and latency, is unavoidable for fixed algorithmic solutions targeting the "worst-case" 
scenario. 

In order to avoid unnecessary complexity, multi-modal receivers employing different detec- 
tion/decoding approaches with respect to the transmission scenario have been proposed 0, 
0. However, such approaches introduce an area overhead which is a function of the supported 
algorithmic approaches (i.e., modes). In addition, they require computationally intensive selection 
strategies for choosing the less complex supported mode from the set of the available ones 
who can provide the target bit error rate (TER) performance. These selection strategies involve 
performance prediction methods for any of the supported modes and have to be repeated any 
time a performance-affecting parameter changes (e.g., the specific channel realization). Therefore, 
the applicability of multi-modal receivers is restricted to those scenarios where the performance 
prediction is both available (i.e., analytical methods for predicting performance exist) and feasible 
(i.e., of low processing overhead). 

The proposed approach assumes a single soft-output detector able to adjust its processing 
requirements to the transmission conditions and the TER. It is based on the depth-first sphere 
decoder (SD) of HI and it can provide detection performance from max-log MAP down to the 
one of order successive interference cancelation 01. In contrast to the aforementioned multi- 
modal approaches, the proposed one provides significant complexity reduction but it does not 
target the minimum detection complexity. Minimizing detector's complexity would require the 
exploitation of the full correcting capabilities of the channel code in order to, finally, reach the 
TER. Therefore, it would prevent from complexity savings at the channel decoder side. 

In the proposed approach, the detection processing requirements are reduced by approximately 
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calculating the soft information of those bits which already meet the TER requirements before 
channel decoding. As it will be explained in detail, such an approximation is feasible since it 
does not significantly affect the performance of the channel decoder. Then, when systematic 
codes are used, selective channel decoding can be performed only for the bits which have not 
already reached the TER before decoding (and their soft information has not been approximated) 
resulting in additional savings at the channel decoder side. Therefore, the overall complexity gains 
of the receiver are finally distributed to both the detector and the channel decoder. In addition, 
the proposed approach avoids the performance prediction burden which is required to minimize 
the detector's complexity for a given code and a given reduced complexity detector. In this way, 
increased practicality and applicability is achieved. 

In order to adjust the SD complexity to the transmission conditions and the TER, the well- 
known concept of log-likelihood ratio (LLR) clipping is employed. This kind of clipping bounds 
the dynamic range of the LLRs and reduces the detector's complexity at the cost of reduced 
performance. The concept of LLR clipping has been originally proposed in HI in order to align 
the detector's complexity to the unavoidable performance loss originating from the fixed-point 
implementation. Therefore, the LLR clipping value was determined by the selected fixed-point 
accuracy (typically selected via extensive simulations which link the fixed-point accuracy to the 
achievable performance). In this work, the concept is extended in the context of scenario-adaptive 
receiver processing. A simple and practical performance driven LLR clipping is proposed in 
order to choose the clipping value "on-demand", according to the TER performance. Adjusting 
the receiver's complexity by changing LLR the clipping value does not introduce any significant 
processing overhead since, typically, clipping is inherent in the depth-first SD approaches similar 
to [U|]. It is noted that the performance driven LLR clipping still incorporates the ability to align 
the detector's complexity to the selected fixed-point accuracy when the accuracy is linked to the 
related bit error-rate performance. 

II. APP Receiver Processing for MIMO Systems 

The soft-output detector operates over several MIMO channel utilizations. It employs the 
received vectors y in order to calculate the a-posteriori soft information of the coded bits. 
The resulting soft information is de-interleaved and fed to the soft-input, soft-output (SISO) 
channel decoder as a-priori information in order to calculate channel decoder's a-posteriori 
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soft information L D . Finally, the decoded bits are calculated from the sign of L D . 

A. Soft Detection in Terms of Sphere Decoding 

In MEMO transmission with M T transmit and M R > M T receive antennas, at the u-th MEMO 
channel utilization, the interleaved coded bits are grouped into blocks B t)U (t = 1, ...,Mt and 
u = 1, U with U being the number of channel utilizations per code block) in order to be 
mapped onto symbols s t)U of a constellation set S of cardinality \S\. The bipolar A;-th bit resides 
in block Br k n og 2 |sn,« an d the blocks B t u are mapped onto the symbols s t>u by a given mapping 
function (e.g., Gray mapping). The corresponding received M R x 1 vector y u is, then, given by 

y u = H M s u + n u , (1) 

where H u is the M R x M T complex channel matrix and s u = [s ljU , s 2)U , sm t ,u] T is the 
transmitted symbol vector. Then, Cf,^ u is the 6-th bit of the i-th entry of s u and the term n u is 
the noise vector, consisting of i.i.d., zero-mean, complex, Gaussian samples with variance 1o\. 

The soft-output detector calculates the a-posteriori LLRs for all the symbols residing in the 
frame to be decoded. Namely, it calculates 

t i \ i ( p [ c b,i,u = +l|y«,HJ\ 

L D {c b ,i,u> = In sr z-, r^-r , V6, z, u. (2) 

\P{c b ,i,u = -l|y„,H u jy 

Assuming that the corresponding bits are statistically independent (due to interleaving) and under 
the max-log approximation, the problem transforms to 

1 2 

L D (c&,i, u ) ~ 7TT min n y« ~ H « s «ll ~ 
2a* sues' 1 

—r min ||y u -H u s u f (3) 

where S^l u are the sub-sets of possible s u symbol sequences having the 6-th bit value of their 
i-th s u entry equal to ±1. 

In order to avoid exhaustive search the problem can be transformed into an equivalent tree- 
search which can be efficiently solved in terms of sphere decoding 0. In detail, the channel 
matrix H u can be QR decomposed into H u = Q U R U , with Q u a unitary M R x M T matrix and 
H u an M T x M T upper triangular matrix with elements Ri j u and real-valued positive diagonal 
entries. Then, under the LLR calculation, (3) transforms to [Q]| 

1 f 2 

Ld ~ ~ y min ||y u -R„s u || - 

fl u b,i,u 
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— mm ||y u -R u s u | 
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where y' u = Q u H y u 



Vim y'i,ui ■■•> v'mt,u 



T 

TV 



B. SISO Channel Decoding 

Similarly to the soft-output detector, after de-interleaving, the resulting soft-information is 
employed to calculate the corresponding a-posteriori information, as 

*P\c k = +l\t A \ 



Ld (cjfc) 



In 



\P[c k = -1\L A J 

where for c being the encoded sequence after de-interleaving with elements c k , (5) becomes 

/ \ / \ 



(5) 



Ld (cfe) 
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(6) 



with C*^ 1 being the set of bit sequences c, of length K, with their /c-th bit equal to ±1. Then 
(6) can be efficiently calculated by the well-known BCJR-MAP algorithm (61, 0. 

From (6) it becomes apparent that the most significantly contributing sequences c to the 

K 



LLR calculation are those with their non-positive m P 



i=l 



Ci\L A \Ci 



terms being close to zero. 



Therefore, these sequences do not contain highly unreliable bits of very low P Cj|L^ (q) , or 
equivalently, bits of high L A value and sign opposite to L A (Sfc)'s. For this reason, saving 
complexity by employing approximate LLR calculation for the highly non-reliable bits is not 
expected to significantly affect the outcome of the channel decoder. Additionally, under the 
approximation of (H 



InP (c k \L A (c k ] 



~ (c k L A (c fc ) - L A (c k/ 



(7) 



which holds for large \L A (c k ) \ values (typically larger than 2) it can be deduced that for 
highly reliable bits (i.e., of high L A (c k ) value and sign equal to the one of L A (c k )) the terms 
In P Ci\L A (q) in (7) equal zero independently of the exact L A value. The last two observations 
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lead to the conclusion that approximate (and thus of lower complexity) calculation of the strong 
soft information, (i.e., of high \L A (c k ) |) is not expected to significantly affect the outcome of 
the SISO channel decoder. 



III. Scenario-Adaptive MAP Receiver Processing 

The proposed approach exploits the ability to calculate the probability of an erroneous hard 
decision of a specific bit by utilizing its soft information. Additionally, as it has been discussed 
in Section II.B, it benefits from the capability to approximate the strong soft information without 
noticeable consequences on the achievable performance. In detail, the proposed approach consists 
of the following steps. 

1) Linking the LLR values to the TER performance 

According to flU the error probability of the hard decision of the bit c with a-posteriori 
LLR L c is 

l + exp(|L(c)|) 

Thus, the bit-error-rate (BER) of a code block can be approximated as 



1 Nj 

A-^E^efcf) (9) 



k=l 



where c± are the Nj information bits. From the above equation it becomes apparent that 
the provided BER is dominated by the bits with small Lo j values. Additionally, 
these are the bits which significantly contribute to the decoding process (see Eq. (6), 
Section II.B). Therefore, no approximation of these weak LLRs is attempted. From (9) it 
also becomes apparent that if all P e (ck) values are lower then the TER, or equivalently, if 
Ld (cfe) > Lter = \n(TER~ l — 1) for all bits, the average performance will also meet 
the TER. Based on the aforementioned observations unnecessary processing which will 
finally result in (c^) > Lter is avoided, while full processing takes place for the rest 
of the bits. 

2) Performing Scenario-Adaptive Soft-Output Detection 

If the soft-output detector's a-posteriori information (which is the a-priori information for 
the SISO channel decoder) of a bit has met the TER constraint it is assumed that its LLR 
value is strong enough that employing an approximate LLR value will not significantly 
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affect the outcome of the SISO channel decoder (see Section II.B). Thus, the complexity 
of the soft-output detector can be reduced by avoiding the extra processing dedicated to 
accurately calculate the soft information of the bits exceeding the TER constraint. This can 
be practically achieved by means of TER performance-driven LLR clipping or, in detail, 
by bounding the SD a-posteriori information in order not to exceed L TER , as it has been 
explained at step 1. 

The proposed approach allows approximations only for the bits which have already reached 
the TER before channel decoding. Then, for those specific bits, the average bit error rate 
performance after channel decoding is expected to be much better than the required TER. 
This observation leads to the discussion in Section I, according to which the proposed 
approach does not target the minimum SD complexity. If this was the case, the average 
error rate performance after channel decoding should just reach TER and it should not be 
better than that. Then, in order to minimize the decoding complexity tighter LLR clipping 
values should be set after predicting the performance gain provided by the channel decoder. 
This performance prediction would increase the computational burden and it would restrain 
the applicability and the practicality of the approach. 
3) Performing Scenario-Adaptive SISO (Systematic) Channel Decoding 

Following the same rationale described at steps 1 and 2, and for systematic channel codes, 
further complexity reduction can take place at the channel decoder side. In detail, target- 
performance-driven, selective SISO decoding can take place only for the bits which do 
not already meet the TER constrained before channel decoding (i.e., for the bits with 
La (cfe) < Lteb)- For the rest hard decisions are taken based on the sign of their a- 
priori information. The way that the selective decoding can be translated to complexity 
savings, is discussed in Section III.B. 

A. Scenario-Adaptive Soft-Output Detection 

Even if the proposed performance-driven LLR clipping under appropriate modifications is 
applicable to most of the SD approaches, the depth-first SD of [[Ij is herein assumed. In 
contrast to the list SD approaches of flU, lfT0l - lfT2l . the SD of [0Q can ensure the (exact) 
max-log MAP performance and, in addition, no modifications are required since the clipping 
procedure is already inherent. However, as also discussed in Section I, the LLR clipping has been 
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originally proposed in order to adjust the detector's complexity to the fixed-point implementation. 
Therefore, in contrast to this work, no discussion has been made on how to choose the clipping 
value, or on how LLR clipping could be used for scenario-adaptive receiver processing. 

The adopted SD employs depth-first tree traversal with Schnorr-Euchner enumeration and 
radius reduction (with infinite initial radius). In order to avoid those redundant calculations which 
are common to the different minimization problems (and tree searches) of (4) the single-tree- 
search approach of (TJ is employed. According to this approach, only one tree search takes place 
but different radii are used for any of the minimization problems. LLR clipping is employed but, 
as already explained, instead of selecting the clipping value in accordance with the fixed-point 
implementation, performance-driven LLR clipping is employed with a clipping value of Lter- 
More details about the SD structure and implementation can be found in flTJ. 

B. Scenario-Adaptive SISO (Systematic) Channel Decoder 

As discussed in Section III, step 3, partial SISO channel decoding can be performed for 
systematic codes, only on the subset of bits not reaching the TER. For log-SISO approaches 
similar to Q, operating in the log domain and employing the max* operator in order to avoid 
the computational expensive multiplications, the expensive operations are not any more the 
corresponding calculations (which mainly become additions). Instead, as it is also shown in 
[fT3l . Ifl4l . the expensive operations are the required energy consuming memory accesses and 
especially the ones related to the state metric storages. The significance effect of those memory 
accesses is also revealed in |[T3l . |fl4|. where in order to reduce them even additional extra 
processing is proposed. In the sequel, equivalently to lfT5l . it is discussed how the selective LLR 
update may result in reduced number of state metric storages. However, this discussion is just 
indicative since the concept of selective SISO channel decoding cannot be quantified to energy 
savings without considering a specific implementation, which is beyond the scope of this work. 

For an 1/2 convolutional code with c Xtt (e) the encoder output bits for a transition e from the 
state s to s' at coding time t (with x = 0, 1) the corresponding L D (c Xyt ) can be expressed as fTl 




max * [5 t {e)\ 



max * [St{e)} 



(10) 



with 



8 t (e) = a t -i[s] + c , t (e)L A (c , t (e)) + c ljt {e)L A (c M (e)) + (3 t [s'] 



(11) 
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and ac t , (3 t being the state metrics obtained through the following forward and backward recursions 

a t (w) = max * a t _i(s) + c^ t (e)L A (c 0)t (e)) + c l t {e)L A (c lit (e)) (12) 

e:s'=w L J 

= max* + c ,t+i(e)L A (c ,t+i(e)) + c M+1 (e)Z A (c M+1 (e))] . (13) 

As discussed in Ifl4ll . the a values can be calculated and overwritten immediately as they 
are not required in future calculations. On the other hand, typically, all (3 metrics need to be 
stored. However, for selective (per bit) channel decoding only the subset of (3 values related to 
the decoded bits needs to be stored, resulting in potential energy consumption savings. Since, as 
we discussed, the energy savings can be quantified only for specific implementations, in Section 
IV we evaluate the potential gains in terms of required stores. 

IV. Simulations 

A 4 x 4 MIMO system is assumed, operating over a spatially and temporally uncorrelated 
Rayleigh flat-fading channel. The encoded bits are mapped onto 16-QAM via Gray coding. A 
systematic (5/7) g recursive convolutional code of rate 1/2 is employed with code block of 1152 
information bits. The log-MAP BCJR algorithm has been employed for SISO channel decoding. 

In Fig. 1 the BER performance of the full proposed scheme (including both scenario-adaptive 
detection and decoding) is depicted for several TER values (10~ 4 , 10~ 3 , 1CT 2 ) compared to the 
typical. As targeted, significant performance degradation is observed only for those signal-to- 
noise (SNR) values which provide BER performance better than the TER. In addition, small, 
unwanted performance degradation can be observed before reaching the TER due to the approx- 
imate nature of (9). 

In Fig. 2 the performance of the proposed scheme with only scenario-adaptive detection and 
full channel decoding is depicted. It is shown that the proposed LLR clipping can preserve the 
optimal performance for SNR ranges up to the one providing the TER one. 

In Fig. 3 the complexity of the SD is depicted in terms of average visited nodes for the several 
TER values. It is shown that significant complexity gains can be achieved over the whole SNR 
range (even when TER has not still be reached) which shows the high efficiency of the LLR 
clipping. For SNR=14 dB and TER=10~ 4 the gain in comparison with the typical solution (i.e., 
without clipping) reaches 92%. Finally, an additional gain from 28-36% can be observed for any 
additional TER increase of one order of magnitude. In Fig. 4 the average required (3 stores are 
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depicted. A reduction of 37% is observed for SNR=14 dB and TER=10 4 , while a gain up to 
35% (for SNR=14 dB) can be observed for any TER increase of one order of magnitude. 

As already discussed in Section I, the proposed method does not target the minimum SD 
complexity since such an approach would include additional burden which would decrease the 
practicality and the applicability of the scheme (see Section III, step 2). However, it exploits 
the additional (from the minimum) processing overhead to decrease the power consumption of 
the channel decoder. In this context, it would be of interest to give a hint on the complexity 
gains originating from those two approaches, even after ignoring the burden and the additional 
processing cost of the required performance prediction. From Fig. 2 it can be seen that if 
prediction methods were available they would select an LLR clipping value of In (1/1CT 2 — 1 ) 



for SNR=12 dB and a TER of 10 3 , while the proposed approach employs an LLR clipping 
value of In (1/10- 3 - 1 ). This results in a complexity overhead of about 23% (see Fig. 3) at 
the SD side. However, in contrast to the minimum SD complexity approach, the proposed one 
allows a reduction of 25% on the number of required j3 stores. In the same way, at 13.7 dB 
and for TER=1CT 4 the optimal LLR clipping value would be again In (l/lCT 2 — 1 ). Then, the 
proposed approach would result in an overhead of 48%, but it would allow a reduction of 51% 
on the number of the required f3 stores. 



A practical and broadly applicable MIMO-APP receiver processing framework has been 
proposed which allows the adjustment of the receiver processing requirements (i.e., of the soft- 
output detector and the SISO channel decoder) to the transmission conditions and the required 
BER. In contrast to receivers supporting multiple detection schemes, the proposed approach does 
not require error prediction and it employs one single soft-output depth-first sphere decoder 
(SD) able to adjust its complexity by means of (BER) performance-driven LLR clipping. Since 
the proposed approach does not target the minimum detector's complexity the corresponding 
processing overhead is exploited to reduce the energy consumption of the SISO channel decoder. 
Despite the simplicity and the easy applicability of the approach, significant complexity savings 
can be observed both at the soft-output detector and the channel decoder. 



V. Conclusion 
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Fig. 1. BER performance with scenario-adaptive detection and decoding for several TER values. 
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4x4 i.i.d. Rayleigh MIMO Channel; 16-QAM (Gray Coding); (5/7^ Conv. Code; Rate 1/2 
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Fig. 2. BER performance with scenario-adaptive detection and full decoding for several TER values. 
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4x4 i.i.d. Rayleigh MIMO Channel; 16-QAM (Gray Coding); (5/7^ Conv. Code; Rate 1/2 
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Fig. 3. Complexity of scenario-adaptive soft-detection in terms of average visited nodes for several TER values. 
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Fig. 4. Average required /3 stores for scenario-adaptive SISO channel decoding and several TER values. 
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